Revisiting Readability: A Unified Framework for Predicting Text Quality

نویسندگان

  • Emily Pitler
  • Ani Nenkova
چکیده

We combine lexical, syntactic, and discourse features to produce a highly predictive model of human readers’ judgments of text readability. This is the first study to take into account such a variety of linguistic factors and the first to empirically demonstrate that discourse relations are strongly associated with the perceived quality of text. We show that various surface metrics generally expected to be related to readability are not very good predictors of readability judgments in our Wall Street Journal corpus. We also establish that readability predictors behave differently depending on the task: predicting text readability or ranking the readability. Our experiments indicate that discourse relations are the one class of features that exhibits robustness across these two tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Qualitative and Quantitative Examination of Text Type Readabilities: A Comparative Analysis

This study compared 2 main approaches to readability assessment. Thequantitative approach applied idea density based on part of speech tagging andcompared 3 sets of text types (i.e., narrative, expository, and argumentative) withrespect to their ease of reading. The qualitative approach was done throughdeveloping questionnaires measuring intermediate EFL learners’ perceptions oncontent, motivat...

متن کامل

Exploring the Relationship Between Modality and Readability Across Different Text Types

With regard to the relationship between the use of modality and readability levels oftexts, 2 opposing views have been raised. The first view endorses direct positiverelationship between modality and readability in the sense that the use of modalityincreases textual understandability. The second view is that the use of modality leadsto an increase in the number of words, resulting in readabilit...

متن کامل

Predicting the readability of transparent text.

Will a simple global masking model based on image detection be successful at predicting the readability of transparent text? Text readability was measured for two types of transparent text: additive (as occurs in head-up displays) and multiplicative (which occurs in see-through liquid crystal display virtual reality displays). Text contrast and background texture were manipulated. Data from two...

متن کامل

An Integrated Approach for Measuring Software Quality and Code Readability

In this paper, we explore the concept of code readability and investigate its relation to software quality[1]. This is a new approach to measuring the complexity of software systems[2]. Software industry uses software metrics to measure the complexity of software systems for software cost estimation, software development control, software assurance, software testing, and software maintenance [3...

متن کامل

On Improving the Accuracy of Readability Classification using Insights from Second Language Acquisition

We investigate the problem of readability assessment using a range of lexical and syntactic features and study their impact on predicting the grade level of texts. As empirical basis, we combined two web-based text sources, Weekly Reader and BBC Bitesize, targeting different age groups, to cover a broad range of school grades. On the conceptual side, we explore the use of lexical and syntactic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008